Overview

Dataset statistics

Number of variables38
Number of observations17000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory4.9 MiB
Average record size in memory304.0 B

Variable types

BOOL24
NUM14

Reproduction

Analysis started2021-03-11 17:25:41.661704
Analysis finished2021-03-11 17:26:44.037209
Versionpandas-profiling v2.6.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml
Avg_Open_To_Buy is highly correlated with Credit_LimitHigh Correlation
Credit_Limit is highly correlated with Avg_Open_To_BuyHigh Correlation
Dependent_count has 1489 (8.8%) zeros Zeros
Contacts_Count_12_mon has 416 (2.4%) zeros Zeros
Total_Revolving_Bal has 6024 (35.4%) zeros Zeros
Avg_Utilization_Ratio has 6024 (35.4%) zeros Zeros

Variables

Customer_Age
Real number (ℝ≥0)

Distinct count45
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean46.262058823529415
Minimum26
Maximum73
Zeros0
Zeros (%)0.0%
Memory size132.9 KiB

Quantile statistics

Minimum26
5-th percentile34
Q141
median46
Q351
95-th percentile59
Maximum73
Range47
Interquartile range (IQR)10

Descriptive statistics

Standard deviation7.387669001
Coefficient of variation (CV)0.1596917472
Kurtosis-0.1363868563
Mean46.26205882
Median Absolute Deviation (MAD)5.899045986
Skewness-0.03254768281
Sum786455
Variance54.57765327
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[26. 26.5 28.5 30.5 32.5 ... 62.5 64.5 65.5 67.5 73. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
45 931 5.5%
 
46 924 5.4%
 
48 902 5.3%
 
47 902 5.3%
 
49 894 5.3%
 
44 892 5.2%
 
43 853 5.0%
 
50 771 4.5%
 
42 742 4.4%
 
51 714 4.2%
 
Other values (35) 8475 49.9%
 
ValueCountFrequency (%) 
26 82 0.5%
 
27 36 0.2%
 
28 31 0.2%
 
29 64 0.4%
 
30 92 0.5%
 
ValueCountFrequency (%) 
73 1 < 0.1%
 
70 1 < 0.1%
 
68 2 < 0.1%
 
67 7 < 0.1%
 
66 2 < 0.1%
 

Dependent_count
Real number (ℝ≥0)

ZEROS
Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.2132941176470586
Minimum0
Maximum5
Zeros1489
Zeros (%)8.8%
Memory size132.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q11
median2
Q33
95-th percentile4
Maximum5
Range5
Interquartile range (IQR)2

Descriptive statistics

Standard deviation1.220994926
Coefficient of variation (CV)0.5516641084
Kurtosis-0.5565274383
Mean2.213294118
Median Absolute Deviation (MAD)1.000238173
Skewness0.03764625176
Sum37626
Variance1.490828609
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 5. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 5018 29.5%
 
3 4595 27.0%
 
1 3409 20.1%
 
4 2049 12.1%
 
0 1489 8.8%
 
5 440 2.6%
 
ValueCountFrequency (%) 
0 1489 8.8%
 
1 3409 20.1%
 
2 5018 29.5%
 
3 4595 27.0%
 
4 2049 12.1%
 
ValueCountFrequency (%) 
5 440 2.6%
 
4 2049 12.1%
 
3 4595 27.0%
 
2 5018 29.5%
 
1 3409 20.1%
 

Months_on_book
Real number (ℝ≥0)

Distinct count44
Unique (%)0.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean35.827411764705886
Minimum13
Maximum56
Zeros0
Zeros (%)0.0%
Memory size132.9 KiB

Quantile statistics

Minimum13
5-th percentile23
Q132
median36
Q340
95-th percentile48
Maximum56
Range43
Interquartile range (IQR)8

Descriptive statistics

Standard deviation7.409098226
Coefficient of variation (CV)0.2067997062
Kurtosis0.5523237776
Mean35.82741176
Median Absolute Deviation (MAD)5.3734009
Skewness-0.09467740172
Sum609066
Variance54.89473652
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[13. 13.5 14.5 17.5 20.5 ... 48.5 49.5 53.5 55.5 56. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
36 3509 20.6%
 
35 800 4.7%
 
37 793 4.7%
 
34 745 4.4%
 
38 704 4.1%
 
33 649 3.8%
 
39 641 3.8%
 
40 598 3.5%
 
32 565 3.3%
 
31 556 3.3%
 
Other values (34) 7440 43.8%
 
ValueCountFrequency (%) 
13 72 0.4%
 
14 19 0.1%
 
15 41 0.2%
 
16 42 0.2%
 
17 47 0.3%
 
ValueCountFrequency (%) 
56 105 0.6%
 
55 59 0.3%
 
54 62 0.4%
 
53 98 0.6%
 
52 93 0.5%
 

Total_Relationship_Count
Real number (ℝ≥0)

Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.4386470588235296
Minimum1
Maximum6
Zeros0
Zeros (%)0.0%
Memory size132.9 KiB

Quantile statistics

Minimum1
5-th percentile1
Q12
median3
Q35
95-th percentile6
Maximum6
Range5
Interquartile range (IQR)3

Descriptive statistics

Standard deviation1.514407326
Coefficient of variation (CV)0.4404078987
Kurtosis-0.9617286988
Mean3.438647059
Median Absolute Deviation (MAD)1.289251979
Skewness0.1097828752
Sum58457
Variance2.293429548
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[1. 2.5 3.5 4.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 4228 24.9%
 
4 3160 18.6%
 
2 3077 18.1%
 
5 2641 15.5%
 
6 1976 11.6%
 
1 1918 11.3%
 
ValueCountFrequency (%) 
1 1918 11.3%
 
2 3077 18.1%
 
3 4228 24.9%
 
4 3160 18.6%
 
5 2641 15.5%
 
ValueCountFrequency (%) 
6 1976 11.6%
 
5 2641 15.5%
 
4 3160 18.6%
 
3 4228 24.9%
 
2 3077 18.1%
 

Months_Inactive_12_mon
Real number (ℝ≥0)

Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.361
Minimum0
Maximum6
Zeros79
Zeros (%)0.5%
Memory size132.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q33
95-th percentile4
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.9146641817
Coefficient of variation (CV)0.3874054137
Kurtosis1.336978945
Mean2.361
Median Absolute Deviation (MAD)0.7463731765
Skewness0.5187666092
Sum40137
Variance0.8366105653
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 0.5 1.5 3.5 4.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2 6599 38.8%
 
3 6521 38.4%
 
1 2774 16.3%
 
4 657 3.9%
 
5 246 1.4%
 
6 124 0.7%
 
0 79 0.5%
 
ValueCountFrequency (%) 
0 79 0.5%
 
1 2774 16.3%
 
2 6599 38.8%
 
3 6521 38.4%
 
4 657 3.9%
 
ValueCountFrequency (%) 
6 124 0.7%
 
5 246 1.4%
 
4 657 3.9%
 
3 6521 38.4%
 
2 6599 38.8%
 

Contacts_Count_12_mon
Real number (ℝ≥0)

ZEROS
Distinct count7
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5194117647058825
Minimum0
Maximum6
Zeros416
Zeros (%)2.4%
Memory size132.9 KiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median3
Q33
95-th percentile4
Maximum6
Range6
Interquartile range (IQR)1

Descriptive statistics

Standard deviation1.020456514
Coefficient of variation (CV)0.4050376077
Kurtosis0.2592335282
Mean2.519411765
Median Absolute Deviation (MAD)0.8400098962
Skewness0.02904054108
Sum42830
Variance1.041331497
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0. 1.5 2.5 3.5 4.5 5.5 6. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
3 6319 37.2%
 
2 5767 33.9%
 
4 2060 12.1%
 
1 2038 12.0%
 
0 416 2.4%
 
5 339 2.0%
 
6 61 0.4%
 
ValueCountFrequency (%) 
0 416 2.4%
 
1 2038 12.0%
 
2 5767 33.9%
 
3 6319 37.2%
 
4 2060 12.1%
 
ValueCountFrequency (%) 
6 61 0.4%
 
5 339 2.0%
 
4 2060 12.1%
 
3 6319 37.2%
 
2 5767 33.9%
 

Credit_Limit
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count12245
Unique (%)72.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean8429.557029441803
Minimum1438.3
Maximum34516.0
Zeros0
Zeros (%)0.0%
Memory size132.9 KiB

Quantile statistics

Minimum1438.3
5-th percentile1438.3
Q12394.75
median4345
Q310585.5
95-th percentile34516
Maximum34516
Range33077.7
Interquartile range (IQR)8190.75

Descriptive statistics

Standard deviation9079.577616
Coefficient of variation (CV)1.077112069
Kurtosis1.984349842
Mean8429.557029
Median Absolute Deviation (MAD)6801.361005
Skewness1.714754119
Sum143302469.5
Variance82438729.69
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 1438.3 1438.49040897 1461.16369822 1633.72530363 1708.3485065 ... 23978.80982524 23982.08873932 24942.5 34513.58269354 34516. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1438.3 969 5.7%
 
34516 860 5.1%
 
9959 20 0.1%
 
15987 18 0.1%
 
23981 14 0.1%
 
2490 11 0.1%
 
6224 11 0.1%
 
3735 11 0.1%
 
7469 10 0.1%
 
2069 8 < 0.1%
 
Other values (12235) 15068 88.6%
 
ValueCountFrequency (%) 
1438.3 969 5.7%
 
1438.680818 1 < 0.1%
 
1438.684376 1 < 0.1%
 
1438.883479 1 < 0.1%
 
1439 2 < 0.1%
 
ValueCountFrequency (%) 
34516 860 5.1%
 
34511.16539 1 < 0.1%
 
34496 1 < 0.1%
 
34482.63812 1 < 0.1%
 
34466.85051 1 < 0.1%
 

Total_Revolving_Bal
Real number (ℝ≥0)

ZEROS
Distinct count2378
Unique (%)14.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean957.4448823529411
Minimum0
Maximum2517
Zeros6024
Zeros (%)35.4%
Memory size132.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median932
Q31694
95-th percentile2517
Maximum2517
Range2517
Interquartile range (IQR)1694

Descriptive statistics

Standard deviation888.6121461
Coefficient of variation (CV)0.9281078864
Kurtosis-1.333215927
Mean957.4448824
Median Absolute Deviation (MAD)792.0878032
Skewness0.293262704
Sum16276563
Variance789631.5462
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.0000e+00 5.0000e-01 4.5550e+02 6.7950e+02 8.5850e+02 ... 1.8235e+03 2.0575e+03 2.4095e+03 2.5165e+03 2.5170e+03], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 6024 35.4%
 
2517 933 5.5%
 
1474 14 0.1%
 
1590 14 0.1%
 
1845 13 0.1%
 
1480 13 0.1%
 
1560 13 0.1%
 
1434 12 0.1%
 
1664 12 0.1%
 
1433 12 0.1%
 
Other values (2368) 9940 58.5%
 
ValueCountFrequency (%) 
0 6024 35.4%
 
1 1 < 0.1%
 
4 2 < 0.1%
 
5 1 < 0.1%
 
6 2 < 0.1%
 
ValueCountFrequency (%) 
2517 933 5.5%
 
2516 6 < 0.1%
 
2515 1 < 0.1%
 
2514 4 < 0.1%
 
2513 3 < 0.1%
 

Avg_Open_To_Buy
Real number (ℝ≥0)

HIGH CORRELATION
Distinct count13147
Unique (%)77.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7472.027284078642
Minimum3.0
Maximum34516.0
Zeros0
Zeros (%)0.0%
Memory size132.9 KiB

Quantile statistics

Minimum3
5-th percentile461.9254739
Q11438.3
median3477.49569
Q39643.571061
95-th percentile32198.50283
Maximum34516
Range34513
Interquartile range (IQR)8205.271061

Descriptive statistics

Standard deviation9087.222768
Coefficient of variation (CV)1.21616563
Kurtosis1.973823122
Mean7472.027284
Median Absolute Deviation (MAD)6808.835671
Skewness1.710449719
Sum127024463.8
Variance82577617.64
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[3.00000000e+00 2.07846248e+02 3.86500000e+02 1.04939578e+03 1.16635391e+03 ... 3.19930000e+04 3.20014529e+04 3.38088534e+04 3.45135827e+04 3.45160000e+04], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
1438.3 715 4.2%
 
34516 218 1.3%
 
31999 42 0.2%
 
787 8 < 0.1%
 
713 7 < 0.1%
 
953 7 < 0.1%
 
463 7 < 0.1%
 
701 7 < 0.1%
 
788 6 < 0.1%
 
1677 6 < 0.1%
 
Other values (13137) 15977 94.0%
 
ValueCountFrequency (%) 
3 1 < 0.1%
 
5.343906764 1 < 0.1%
 
10 1 < 0.1%
 
12.3413654 1 < 0.1%
 
13.48612786 1 < 0.1%
 
ValueCountFrequency (%) 
34516 218 1.3%
 
34511.16539 1 < 0.1%
 
34505.72649 1 < 0.1%
 
34500.06797 1 < 0.1%
 
34496.46368 1 < 0.1%
 

Total_Amt_Chng_Q4_Q1
Real number (ℝ≥0)

Distinct count8008
Unique (%)47.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.7338537384106416
Minimum0.0
Maximum3.397
Zeros5
Zeros (%)< 0.1%
Memory size132.9 KiB

Quantile statistics

Minimum0
5-th percentile0.4276631959
Q10.605
median0.722
Q30.85
95-th percentile1.038012071
Maximum3.397
Range3.397
Interquartile range (IQR)0.245

Descriptive statistics

Standard deviation0.2087288077
Coefficient of variation (CV)0.2844283496
Kurtosis7.84793204
Mean0.7338537384
Median Absolute Deviation (MAD)0.1536681845
Skewness1.234033314
Sum12475.51355
Variance0.04356771515
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 1.32664576e-03 1.93561754e-01 2.99330632e-01 3.56767155e-01 ... 1.32850000e+00 1.52800000e+00 1.74950000e+00 2.36250000e+00 3.39700000e+00], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.791 36 0.2%
 
0.743 34 0.2%
 
0.712 34 0.2%
 
0.718 33 0.2%
 
0.735 33 0.2%
 
0.744 32 0.2%
 
0.699 32 0.2%
 
0.722 32 0.2%
 
0.69 31 0.2%
 
0.717 31 0.2%
 
Other values (7998) 16672 98.1%
 
ValueCountFrequency (%) 
0 5 < 0.1%
 
0.002653291513 1 < 0.1%
 
0.01 1 < 0.1%
 
0.018 1 < 0.1%
 
0.02195488155 1 < 0.1%
 
ValueCountFrequency (%) 
3.397 1 < 0.1%
 
3.355 1 < 0.1%
 
2.675 1 < 0.1%
 
2.594 1 < 0.1%
 
2.368 1 < 0.1%
 

Total_Trans_Amt
Real number (ℝ≥0)

Distinct count6174
Unique (%)36.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3879.254117647059
Minimum510
Maximum18484
Zeros0
Zeros (%)0.0%
Memory size132.9 KiB

Quantile statistics

Minimum510
5-th percentile1128.95
Q12016
median2655.5
Q34589
95-th percentile9406.25
Maximum18484
Range17974
Interquartile range (IQR)2573

Descriptive statistics

Standard deviation3066.537381
Coefficient of variation (CV)0.7904966492
Kurtosis4.822334589
Mean3879.254118
Median Absolute Deviation (MAD)2127.988744
Skewness2.139417109
Sum65947320
Variance9403651.511
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 510. 642.5 682.5 952.5 1190.5 ... 13072. 13606.5 15589.5 16736.5 18484. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
2465 16 0.1%
 
2299 16 0.1%
 
2510 15 0.1%
 
2473 15 0.1%
 
2492 15 0.1%
 
2312 14 0.1%
 
1865 14 0.1%
 
2505 14 0.1%
 
2212 14 0.1%
 
2229 14 0.1%
 
Other values (6164) 16853 99.1%
 
ValueCountFrequency (%) 
510 1 < 0.1%
 
518 1 < 0.1%
 
530 2 < 0.1%
 
563 2 < 0.1%
 
569 1 < 0.1%
 
ValueCountFrequency (%) 
18484 1 < 0.1%
 
17995 1 < 0.1%
 
17744 1 < 0.1%
 
17634 1 < 0.1%
 
17628 1 < 0.1%
 

Total_Trans_Ct
Real number (ℝ≥0)

Distinct count126
Unique (%)0.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean56.71011764705882
Minimum10
Maximum139
Zeros0
Zeros (%)0.0%
Memory size132.9 KiB

Quantile statistics

Minimum10
5-th percentile26
Q140
median51
Q373
95-th percentile95
Maximum139
Range129
Interquartile range (IQR)33

Descriptive statistics

Standard deviation22.42036382
Coefficient of variation (CV)0.3953503316
Kurtosis-0.04565179241
Mean56.71011765
Median Absolute Deviation (MAD)18.83581309
Skewness0.5911287199
Sum964072
Variance502.6727137
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[ 10. 13.5 16.5 24.5 29.5 ... 94.5 106.5 124.5 131.5 139. ], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
43 527 3.1%
 
40 494 2.9%
 
42 490 2.9%
 
41 473 2.8%
 
39 469 2.8%
 
44 463 2.7%
 
45 419 2.5%
 
38 419 2.5%
 
46 354 2.1%
 
37 347 2.0%
 
Other values (116) 12545 73.8%
 
ValueCountFrequency (%) 
10 6 < 0.1%
 
11 6 < 0.1%
 
12 10 0.1%
 
13 17 0.1%
 
14 33 0.2%
 
ValueCountFrequency (%) 
139 1 < 0.1%
 
138 1 < 0.1%
 
134 1 < 0.1%
 
132 1 < 0.1%
 
131 6 < 0.1%
 

Total_Ct_Chng_Q4_Q1
Real number (ℝ≥0)

Distinct count7658
Unique (%)45.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6486859173999783
Minimum0.0
Maximum3.714
Zeros7
Zeros (%)< 0.1%
Memory size132.9 KiB

Quantile statistics

Minimum0
5-th percentile0.318174326
Q10.4997067615
median0.64
Q30.773
95-th percentile1
Maximum3.714
Range3.714
Interquartile range (IQR)0.2732932385

Descriptive statistics

Standard deviation0.232729592
Coefficient of variation (CV)0.3587708408
Kurtosis11.93366815
Mean0.6486859174
Median Absolute Deviation (MAD)0.1706933549
Skewness1.665810341
Sum11027.6606
Variance0.054163063
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 2.07069047e-04 1.03353160e-01 2.10853324e-01 2.49981879e-01 ... 1.20024877e+00 1.36550000e+00 1.65111139e+00 2.53550000e+00 3.71400000e+00], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0.667 171 1.0%
 
1 171 1.0%
 
0.5 169 1.0%
 
0.75 158 0.9%
 
0.6 115 0.7%
 
0.8 101 0.6%
 
0.714 92 0.5%
 
0.833 85 0.5%
 
0.778 69 0.4%
 
0.625 63 0.4%
 
Other values (7648) 15806 93.0%
 
ValueCountFrequency (%) 
0 7 < 0.1%
 
0.0004141380933 1 < 0.1%
 
0.01144765327 1 < 0.1%
 
0.0162513849 1 < 0.1%
 
0.01751456843 1 < 0.1%
 
ValueCountFrequency (%) 
3.714 1 < 0.1%
 
3.571 1 < 0.1%
 
3.5 1 < 0.1%
 
3.25 1 < 0.1%
 
3 2 < 0.1%
 

Avg_Utilization_Ratio
Real number (ℝ≥0)

ZEROS
Distinct count4225
Unique (%)24.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.22810776607440256
Minimum0.0
Maximum0.9990000000000001
Zeros6024
Zeros (%)35.4%
Memory size132.9 KiB

Quantile statistics

Minimum0
5-th percentile0
Q10
median0.094
Q30.4139462954
95-th percentile0.8026073976
Maximum0.999
Range0.999
Interquartile range (IQR)0.4139462954

Descriptive statistics

Standard deviation0.2764862316
Coefficient of variation (CV)1.212086008
Kurtosis-0.274460143
Mean0.2281077661
Median Absolute Deviation (MAD)0.2334383113
Skewness1.02183859
Sum3877.832023
Variance0.07644463628
Histogram with fixed size bins (bins=10)
Histogram with variable size bins (bins=[0.00000000e+00 2.73981450e-05 2.19849688e-02 2.20188257e-02 2.29653234e-02 ... 8.73359608e-01 8.96926042e-01 8.97082766e-01 9.25277680e-01 9.99000000e-01], "bayesian blocks" binning strategy used)
ValueCountFrequency (%) 
0 6024 35.4%
 
0.073 62 0.4%
 
0.057 33 0.2%
 
0.048 32 0.2%
 
0.059 31 0.2%
 
0.06 30 0.2%
 
0.045 29 0.2%
 
0.061 29 0.2%
 
0.069 28 0.2%
 
0.053 27 0.2%
 
Other values (4215) 10675 62.8%
 
ValueCountFrequency (%) 
0 6024 35.4%
 
5.479628996e-05 1 < 0.1%
 
0.0001552943402 1 < 0.1%
 
0.0001852195268 1 < 0.1%
 
0.0001893264982 1 < 0.1%
 
ValueCountFrequency (%) 
0.999 1 < 0.1%
 
0.9980488494 1 < 0.1%
 
0.995 1 < 0.1%
 
0.994 1 < 0.1%
 
0.9934390897 1 < 0.1%
 

Gender_F
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
8934
1
8066
ValueCountFrequency (%) 
0 8934 52.6%
 
1 8066 47.4%
 

Gender_M
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
10542
1
6458
ValueCountFrequency (%) 
0 10542 62.0%
 
1 6458 38.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
15916
1
 
1084
ValueCountFrequency (%) 
0 15916 93.6%
 
1 1084 6.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16533
1
 
467
ValueCountFrequency (%) 
0 16533 97.3%
 
1 467 2.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
13261
1
3739
ValueCountFrequency (%) 
0 13261 78.0%
 
1 3739 22.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
14735
1
 
2265
ValueCountFrequency (%) 
0 14735 86.7%
 
1 2265 13.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16469
1
 
531
ValueCountFrequency (%) 
0 16469 96.9%
 
1 531 3.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
15364
1
 
1636
ValueCountFrequency (%) 
0 15364 90.4%
 
1 1636 9.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
15306
1
 
1694
ValueCountFrequency (%) 
0 15306 90.0%
 
1 1694 10.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16211
1
 
789
ValueCountFrequency (%) 
0 16211 95.4%
 
1 789 4.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
11003
1
5997
ValueCountFrequency (%) 
0 11003 64.7%
 
1 5997 35.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
11836
1
5164
ValueCountFrequency (%) 
0 11836 69.6%
 
1 5164 30.4%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16219
1
 
781
ValueCountFrequency (%) 
0 16219 95.4%
 
1 781 4.6%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16163
1
 
837
ValueCountFrequency (%) 
0 16163 95.1%
 
1 837 4.9%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
15005
1
 
1995
ValueCountFrequency (%) 
0 15005 88.3%
 
1 1995 11.7%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
15467
1
 
1533
ValueCountFrequency (%) 
0 15467 91.0%
 
1 1533 9.0%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
15166
1
 
1834
ValueCountFrequency (%) 
0 15166 89.2%
 
1 1834 10.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
12016
1
4984
ValueCountFrequency (%) 
0 12016 70.7%
 
1 4984 29.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
15766
1
 
1234
ValueCountFrequency (%) 
0 15766 92.7%
 
1 1234 7.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
1
15589
0
 
1411
ValueCountFrequency (%) 
1 15589 91.7%
 
0 1411 8.3%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16866
1
 
134
ValueCountFrequency (%) 
0 16866 99.2%
 
1 134 0.8%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16980
1
 
20
ValueCountFrequency (%) 
0 16980 99.9%
 
1 20 0.1%
 
Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
0
16339
1
 
661
ValueCountFrequency (%) 
0 16339 96.1%
 
1 661 3.9%
 

target
Boolean

Distinct count2
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size132.9 KiB
1
8500
0
8500
ValueCountFrequency (%) 
1 8500 50.0%
 
0 8500 50.0%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Missing values

Sample

First rows

Customer_AgeDependent_countMonths_on_bookTotal_Relationship_CountMonths_Inactive_12_monContacts_Count_12_monCredit_LimitTotal_Revolving_BalAvg_Open_To_BuyTotal_Amt_Chng_Q4_Q1Total_Trans_AmtTotal_Trans_CtTotal_Ct_Chng_Q4_Q1Avg_Utilization_RatioGender_FGender_MEducation_Level_CollegeEducation_Level_DoctorateEducation_Level_GraduateEducation_Level_High SchoolEducation_Level_Post-GraduateEducation_Level_UneducatedEducation_Level_UnknownMarital_Status_DivorcedMarital_Status_MarriedMarital_Status_SingleMarital_Status_UnknownIncome_Category_$120K +Income_Category_$40K - $60KIncome_Category_$60K - $80KIncome_Category_$80K - $120KIncome_Category_Less than $40KIncome_Category_UnknownCard_Category_BlueCard_Category_GoldCard_Category_PlatinumCard_Category_Silvertarget
04533951312691.077711914.01.3351144421.6250.061010001000010000100010000
1495446128256.08647392.01.5411291333.7140.105100010000001000001010000
2513364103418.003418.02.5941887202.3330.000010010000010000010010000
3404343413313.02517796.01.4051171202.3330.760100001000000100001010000
4403215104716.004716.02.175816282.5000.000010000010010000100010000
5442363124010.012472763.01.3761088240.8460.311010010000010001000010000
65144661334516.0226432252.01.9751330310.7220.066010000001010010000001000
73202722229081.0139627685.02.2041538360.7140.048010001000000100100000010
83733652022352.0251719835.03.3551350241.1820.113010000010001000100010000
94823663311656.016779979.01.5241441320.8820.144010010000001000010010000

Last rows

Customer_AgeDependent_countMonths_on_bookTotal_Relationship_CountMonths_Inactive_12_monContacts_Count_12_monCredit_LimitTotal_Revolving_BalAvg_Open_To_BuyTotal_Amt_Chng_Q4_Q1Total_Trans_AmtTotal_Trans_CtTotal_Ct_Chng_Q4_Q1Avg_Utilization_RatioGender_FGender_MEducation_Level_CollegeEducation_Level_DoctorateEducation_Level_GraduateEducation_Level_High SchoolEducation_Level_Post-GraduateEducation_Level_UneducatedEducation_Level_UnknownMarital_Status_DivorcedMarital_Status_MarriedMarital_Status_SingleMarital_Status_UnknownIncome_Category_$120K +Income_Category_$40K - $60KIncome_Category_$60K - $80KIncome_Category_$80K - $120KIncome_Category_Less than $40KIncome_Category_UnknownCard_Category_BlueCard_Category_GoldCard_Category_PlatinumCard_Category_Silvertarget
169904814023428777.12663811628661.1183180.6065122176510.6383900.004042010000000000000000000001
169913732921213750.498031013750.4980310.380408782180.2202600.000000010000000010000000010001
16992443341221655.7728461498156.9530080.5228872027440.5231390.905183000000000001000000010001
16993501365232164.97686902164.9768690.6476802367560.6983000.000000100000000000000000010001
16994402331241714.1422231488226.1074640.3775481824340.2742200.868451010000000001000000010001
16995493362222472.65856002472.6585600.4476132032430.3934720.000000100000000000000001010001
16996462272139007.9558973338674.7535360.9370497641760.9223520.035326000001000000000000010001
169975624532314781.974577014781.9745770.798214724170.7890930.000000010000000000000100010001
16998443374333473.9887662517956.9887660.6781462547460.6273910.724777000000000010000000010001
169995424714213441.277260013441.2772600.9218544904500.3181750.000000000000000001000000000001